TupleRank and Implicit Relationship Discovery in Relational Databases
نویسندگان
چکیده
Google's successful PageRank brings to the Web an order that well re ects the relative importance of Web pages. Inspired by PageRank, we propose a similar scheme called TupleRank for ranking tuples in a relational database. Database tuples naturally relate to each other through referential integrity constraints declared in the schema. However, such constraints cannot capture more general relationships such as similarity. Furthermore, relationships determined statically from the database schema do not re ect actual query patterns that arise at runtime. To address these de ciencies of static TupleRank, we introduce the notion of query-driven TupleRank, which is based on a link structure that is dynamically constructed from a workload. Speci cally, database tuples are considered to be related if they are joined together by a query result tuple. The main challenge in supporting query-driven TupleRank is how to record multiway relations among tuples eÆciently for a large workload. We develop techniques to compute query-driven TupleRank accurately and eÆciently with low space requirement that is independent of the workload size. We further augment query-driven TupleRank so that it can better utilize the access frequency information collected from the workload. Preliminary experiment results demonstrate that TupleRank is both informative and intuitive, and they con rm the advantages of query-driven TupleRank over static TupleRank.
منابع مشابه
Exploring Implicit Relationships In a Relational Database
The Google search engine [BP98] has brought the enormous web pages an order that well reflects the degree of relevance and importance of the web pages with respect to search keywords. The technique used to determine the order is called PageRank [PBMW97], which origins from the idea of academic citation: a paper should receive more attention if it is cited by many (important) papers. By analyzin...
متن کاملTupleRank: Ranking Discovered Content in Virtual Databases
Recently, the problem of data integration has been newly addressed by methods based on machine learning and discovery. Such methods are intended to automate, at least in part, the laborious process of information integration, by which existing data sources are incorporated in a virtual database. Essentially, these methods scan new data sources, attempting to discover possible mappings to the vi...
متن کاملDiscovery of Data Dependencies in Relational Databases Lss8 Report 14 Discovery of Data Dependencies in Relational Databases Lss8 Report 14
Knowledge discovery in databases is not only the nontrivial extraction of implicit, previously unknown and potentially useful information from databases. We argue that in contrast to machine learning, knowledge discovery in databases should be applied to real world databases. Since real world databases are known to be very large, they raise problems of the access. Therefore, real world database...
متن کاملKDD – Knowledge Discovery in Databases
2 Database Management Systems 3 2.1 Three-Schema Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 2.2 Organisation of an Integrated Database System . . . . . . . . . . . . . . . . . . . . 5 2.3 Hierarchical and Network Databases . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2.4 Relational Databases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ....
متن کاملAttribute-oriented Induction in Ob Ject-oriented Databases
Knowledge discovery in databases is the nontrivial extraction of implicit, previously unknown, and potentially useful information from data such that the extracted knowledge may facilitate deductive reasoning and query processing in database systems. This branch of study has been ranked among the most promising topics for database research for the 1990s. Due to the dominating influence of relat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003